Feeds to Scour
SubscribedAll
Scoured 255313 posts in 2.51 s
Basics of Reinforcement Learning for LLMs
cameronrwolfe.substack.com·11h·
Discuss: Substack
📊Dynamic Programming
Preview
Report Post
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
reddit.com·23h·
📊Dynamic Programming
Preview
Report Post
Deep Reinforcement Learning: An Overview
paperium.net·2d·
Discuss: DEV
📊Dynamic Programming
Preview
Report Post
Learning General Policies with Policy Gradient Methods
arxiv.org·4d
📊Optimization
Preview
Report Post
Deep Reinforcement Learning: An Overview
dev.to·2d·
Discuss: DEV
📊Dynamic Programming
Preview
Report Post
Introducing the XLab AI Security Guide
lesswrong.com·7h
🛡️AI Security
Preview
Report Post
TIL every time you remember something, your brain slightly rewrites that memory instead of replaying it exactly
frontiersin.org·7h·
🎴Anki
Preview
Report Post
Building a Neural Network from scratch
pub.towardsai.net
·2d
📱Edge AI
Preview
Report Post
Chad Dorsey - Concord, Massachusetts, United States | Professional Profile
linkedin.com·1h
💬Prompt Engineering
Preview
Report Post
Human Processor Model
en.wikipedia.org·22h·
Discuss: Hacker News
📊Dynamic Programming
Preview
Report Post
Two-layer coordinated operation of multi-energy system considering carbon-oriented collaborative pricing mechanism via two-stage stochastic programming approach
sciencedirect.com·2d
📊Dynamic Programming
Preview
Report Post
🎲 Learning is about building personal context
simeongriggs.dev·1d
🎴Anki
Preview
Report Post
Claude's take on RLHF and self-doubt
future.forem.com·17h·
Discuss: DEV
🎴Anki
Preview
Report Post
This AI Paper from Stanford and Harvard Explains Why Most 'Agentic AI' Systems Feel Impressive in Demos and then Completely Fall Apart in Real Use
www-marktechpost-com.cdn.ampproject.org·1d
💬Prompt Engineering
Preview
Report Post
Chapter 2 The Targeted Learning Roadmap
tlverse.org·1d
🧠Machine Learning
Preview
Report Post
Hj Hornbeck
freethoughtblogs.com·16h
🌊CALM Theorem
Preview
Report Post
Self-Supervised Temporal Pattern Mining for circular manufacturing supply chains with embodied agent feedback loops
dev.to·2h·
Discuss: DEV
LMAX Disruptor
Preview
Report Post
Book Review: Why Machines Learn
philippdubach.com·1d·
Discuss: Hacker News
💬Prompt Engineering
Preview
Report Post
Training a Model on Multiple GPUs with Data Parallelism
machinelearningmastery.com·1d
🔥PyTorch
Preview
Report Post